Cluster Analysis: A Toolbox for MATLAB

نویسندگان

  • Lawrence Hubert
  • Douglas Steinley
چکیده

A broad definition of clustering can be given as the search for homogeneous groupings of objects based on some type of available data. There are two common such tasks now discussed in (almost) all multivariate analysis texts and implemented in the commercially available behavioral and social science statistical software suites: hierarchical clustering and the K-means partitioning of some set of objects. This chapter begins with a brief review of these topics using two illustrative data sets that are carried along throughout this chapter for numerical illustration. Later sections will develop hierarchical clustering through least-squares and the characterizing notion of an ultrametric; K-means partitioning is generalized by rephrasing as an optimization problem of subdividing a given proximity matrix. In all instances, the MATLAB computational environment is relied on to effect our analyses, using the Statistical Toolbox, for example, to carry out the common hierarchical clustering and K-means methods, and our own open-source MATLAB M-files when the extensions go beyond what is currently available commercially (the latter are freely available as a Toolbox from www.cda.psych.uiuc.edu/clusteranalysis_mfiles). Also, to maintain a reasonable printed size for the present handbook contribution, the table of contents, figures, and tables for the full chapter, plus the final section and the header comments for the M-files in Appendix A, are available from www.cda.psych.uiuc.edu/cluster_analysis_parttwo.pdf

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A MATLAB Toolbox and its Web based Variant for Fuzzy Cluster Analysis

Nowadays due to the yearly multiplying data comes always the need for useful methods, algorithms, that make the processing of these data easier. For the solution of this problem data mining tools come into existence, to which clustering algorithms belong. The purpose of this paper is to propose a continuously extensible, standard tool, which is useful for any MATLAB user for one’s aim. The tool...

متن کامل

Parallel Spatial Pyramid Match Kernel Algorithm for Object Recognition using a Cluster of Computers

This paper parallelizes the spatial pyramid match kernel (SPK) implementation. SPK is one of the most usable kernel methods, along with support vector machine classifier, with high accuracy in object recognition. MATLAB parallel computing toolbox has been used to parallelize SPK. In this implementation, MATLAB Message Passing Interface (MPI) functions and features included in the toolbox help u...

متن کامل

A Matlab toolbox for grey clustering and fuzzy comprehensive evaluation

In this article, we propose totally new grey clustering method and fuzzy comprehensive evaluation method and accordingly, a Matlab toolbox for grey clustering statistic and fuzzy comprehensive evaluation is developed. As an illustrative example, we use the toolbox developed for carrying on an analysis of the test scores of the Grade 3 at the National Changhua Girls’ Senior High School, Taiwan. ...

متن کامل

MLC Toolbox: A MATLAB/OCTAVE Library for Multi-Label Classification

Multi-Label Classification toolbox is a MATLAB/OCTAVE library for Multi-Label Classification (MLC). There exists a few Java libraries for MLC, but no MATLAB/OCTAVE library that covers various methods. This toolbox offers an environment for evaluation, comparison and visualization of the MLC results. One attraction of this toolbox is that it enables us to try many combinations of feature space d...

متن کامل

Cluster Analysis and Genetic Algorithms

The paper deals with the cluster analysis and genetic algorithms and describes their basis. The application of genetic algorithms is focused on a cluster analysis as an optimization task. The case studies present the way of solution of two and three dimensional cluster analysis in MATLAB program with use of the Genetic Algorithm and Direct Search Toolbox. The way of its possible use in business...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008